Rank in Wordlist | Frequency | Word |
---|---|---|
4062 | 145 | 59,5% |
6183 | 91 | 1,5 |
8102 | 65 | 2,5 |
10229 | 49 | 3,5 |
11171 | 44 | 1,2 |
12059 | 40 | 1,6 |
13362 | 35 | 4,5 |
14299 | 32 | 1,3 |
15408 | 29 | 5,5 |
17863 | 24 | 1,4 |
Rank in Wordlist | Frequency | Word |
---|---|---|
137192 | 1 | .) |
Rank in Wordlist | Frequency | Word |
---|---|---|
3675 | 162 | 59.5% |
3890 | 152 | 50% |
4062 | 145 | 59,5% |
4761 | 123 | 10% |
4904 | 118 | 20% |
5535 | 104 | 100% |
5880 | 97 | 70% |
5881 | 97 | 80% |
5920 | 96 | 60% |
6305 | 89 | 90% |
Rank in Wordlist | Frequency | Word |
---|---|---|
8857 | 59 | R&B |
31804 | 11 | R&B/Hip-Hop |
33322 | 10 | A&M |
52810 | 5 | A&E |
52822 | 5 | AT&T |
54887 | 5 | L&PM |
61647 | 4 | C&A |
63339 | 4 | H&M |
65102 | 4 | P&D |
65586 | 4 | R&B/Hip |
Rank in Wordlist | Frequency | Word |
---|---|---|
72292 | 3 | 000$000 |
81754 | 3 | R$5 |
93649 | 2 | 472.000:000$000 |
94029 | 2 | 900$000 |
115099 | 2 | R$1 |
120303 | 2 | US$1 |
120304 | 2 | US$2 |
120305 | 2 | US$20 |
120306 | 2 | US$30 |
120307 | 2 | US$50 |
Rank in Wordlist | Frequency | Word |
---|---|---|
324 | 1556 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
6215 | 91 | d'água |
7075 | 77 | Don't |
9227 | 56 | I'm |
10253 | 49 | It's |
10773 | 46 | .' |
13979 | 33 | Can't |
16001 | 28 | Women's |
18623 | 23 | O'Brien |
24257 | 16 | King's |
24495 | 16 | You're |
Rank in Wordlist | Frequency | Word |
---|---|---|
42667 | 7 | CD+DVD |
49542 | 6 | UTC+1 |
60390 | 4 | 1+1 |
66815 | 4 | UTC+2 |
82141 | 3 | Rio+20 |
89717 | 3 | n+1 |
93167 | 2 | 2+2 |
97172 | 2 | Blohm + Voss |
97866 | 2 | C+C |
100537 | 2 | DVD+CD |
Rank in Wordlist | Frequency | Word |
---|---|---|
379 | 1337 | km/s |
1258 | 465 | hab/km² |
2799 | 214 | e/ou |
3845 | 154 | km/h |
5580 | 104 | residências/km² |
14749 | 31 | S/A |
22962 | 17 | 1/3 |
23682 | 17 | http://www |
23977 | 16 | 1/4 |
25144 | 15 | 2/3 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots